Comparing three Critic Models of Reinforcement Learning in the Basal Ganglia Connected to a Detailed Actor in a S-R Task

نویسندگان

  • Mehdi Khamassi
  • Benoît Girard
  • Alain Berthoz
  • Agnès Guillot
چکیده

Actor-Critic architectures of reinforcement learning were found to show a strong resemblance with known anatomy and function of a part of the vertebrate's brain: the basal ganglia. Based on this analogy, a large number of Actor-Critic models were simulated to reproduce behaviours of rats performing laboratory tasks. However, most of these models were tested in different tasks and it is often difficult to compare their efficiency. The work presented here concerns the comparison of three Critics, tested with the same Actor part taking into account known basal ganglia anatomy. The specificities of the three Critics connected to this Actor lie on the absence or presence of a temporal representation of stimuli, and on the use of one or more units for the prediction of reward. These architectures are implemented in the same simulated robot performing the same stimulus-response (S-R) experiment: a reward-seeking task. Results show that both temporal representation of stimuli and multiple prediction units are mandatory for the achievement of the task. Improvements for Critic modelling and competing hypotheses for Actor-Critic models are finally discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats

Since 1995, numerous Actor-Critic architectures for reinforcement learning have been proposed as models of dopamine-like reinforcement learning mechanisms in the rat’s basal ganglia. However, these models were usually tested in different tasks, and it is then difficult to compare their efficiency for an autonomous animat. We present here the comparison of four architectures in an animat as it p...

متن کامل

Actor-critic models of the basal ganglia: new anatomical and computational perspectives

A large number of computational models of information processing in the basal ganglia have been developed in recent years. Prominent in these are actor-critic models of basal ganglia functioning, which build on the strong resemblance between dopamine neuron activity and the temporal difference prediction error signal in the critic, and between dopamine-dependent long-term synaptic plasticity in...

متن کامل

Cognitive Science Honors Thesis A Computational Account of Sensory Prediction Error Gating in Reinforcement Learning Models

A successful return in tennis requires a tennis player first to determine where best to place her return and then to correctly execute her swing. If she makes an errant return, she now faces a credit assignment problem: Should this negative outcome be attributed to poor shot selection or to an error in motor execution? McDougle et al. propose a solution to this problem when the source of the er...

متن کامل

Integration of Reinforcement Learning and Optimal Decision-Making Theories of the Basal Ganglia

This article seeks to integrate two sets of theories describing action selection in the basal ganglia: reinforcement learning theories describing learning which actions to select to maximize reward and decision-making theories proposing that the basal ganglia selects actions on the basis of sensory evidence accumulated in the cortex. In particular, we present a model that integrates the actor-c...

متن کامل

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g., when function approximation is involved). Interestingly, there is growing evidence that actor-critic approaches based on phasic dopamine signals play a key role in biological learning through cortical and basal gangli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004